Identifying Risk Factors for Severe Childhood Malnutrition by Boosting Additive Quantile Regression
نویسندگان
چکیده
Ordinary linear and generalized linear regression models relate the mean of a response variable to a linear combination of covariate effects and, as a consequence, focus on average properties of the response. Analyzing childhood malnutrition in developing or transition countries based on such a regression model implies that the estimated effects describe the average nutritional status. However, it is of even larger interest to analyze quantiles of the response distribution such as the 5% or 10% quantile that relate to the risk of children for extreme malnutrition. In this paper, we analyze data on childhood malnutrition collected in the 2005/2006 India Demographic and Health Survey based on a semiparametric extension of quantile regression models where nonlinear effects are included in the model equation, leading to additive quantile regression. The variable selection and model choice problems associated with estimating an additive quantile regression model are addressed by a novel boosting approach. Based on this rather general class of statistical learning procedures for empirical risk minimization, we develop, evaluate and apply a boosting algorithm for quantile regression. Our proposal allows for data-driven determination of the amount of smoothness required for the nonlinear effects and combines model selection with an automatic variable selection property. The results of our empirical evaluation suggest that boosting is an appropriate tool for estimation in linear and additive quantile regression models and helps to identify yet unknown risk factors for childhood malnutrition.
منابع مشابه
Boosting structured additive quantile regression for longitudinal childhood obesity data.
Childhood obesity and the investigation of its risk factors has become an important public health issue. Our work is based on and motivated by a German longitudinal study including 2,226 children with up to ten measurements on their body mass index (BMI) and risk factors from birth to the age of 10 years. We introduce boosting of structured additive quantile regression as a novel distribution-f...
متن کاملDetection of risk factors for obesity in early childhood with quantile regression methods for longitudinal data
This article compares and discusses three different statistical methods for investigating risk factors for overweight and obesity in early childhood by means of the LISA study, a recent German birth cohort study with 3097 children. Since the definition of overweight and obesity is typically based on upper quantiles (90% and 97%) of the age specific body mass index (BMI) distribution, our aim wa...
متن کاملUnderstanding Child Stunting in India: A Comprehensive Analysis of Socio-Economic, Nutritional and Environmental Determinants Using Additive Quantile Regression
BACKGROUND Most attempts to address undernutrition, responsible for one third of global child deaths, have fallen behind expectations. This suggests that the assumptions underlying current modelling and intervention practices should be revisited. OBJECTIVE We undertook a comprehensive analysis of the determinants of child stunting in India, and explored whether the established focus on linear...
متن کاملAdditive Models for Quantile Regression: Some New Methods for R
This brief report describes some recent developments of the R quantreg package to incorporate methods for additive models. The methods are illustrated with an application to modeling childhood malnutrition in India. Models with additive nonparametric effects offer a valuable dimension reduction device throughout applied statistics. In this paper we describe some recent developments of additive ...
متن کاملPrediction intervals for future BMI values of individual children - a non-parametric approach by quantile boosting
BACKGROUND The construction of prediction intervals (PIs) for future body mass index (BMI) values of individual children based on a recent German birth cohort study with n = 2007 children is problematic for standard parametric approaches, as the BMI distribution in childhood is typically skewed depending on age. METHODS We avoid distributional assumptions by directly modelling the borders of ...
متن کامل